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Abstract 



In this paper, we present an alternative approach to multiple inheritance 
for typed feature structures. In our approach, a feature structure can be as- 
sociated with several types coming from different hierarchies (dimensions). 
0\ ' In case of multiple inheritance, a type has supertypes from different hier- 

archies. We contrast this approach with approaches based on a single type 
hierarchy where a feature structure has only one unique most general type, 
and multiple inheritance involves computation of greatest lower bounds in 

a the hierarchy. The proposed approach supports current linguistic analyses in 

constraint-based formalisms like HPSG, inheritance in the lexicon, and knowl- 
edge representation for NLP systems. Finally, we show that multi-dimensional 
inheritance hierarchies can be compiled into a Prolog term representation, 
which allows to compute the conjunction of two types efficiently by Prolog 
r> ' term unification. 

1 Introduction 

In the past years, multiple inheritance has been increasingly used for the description 
of natural languages. Some examples are the work in Head-driven Phrase Structure 
Grammar (hpsg) on the structure of the lexico n |13[ [l9| , semantic sorts for selec- 
tional restrictions || , derivational morphology fr20|, [15|, and the syntax of English 
relative clauses [Ell . As an example of this type of analysis, figure [I] shows a recent 
HPSG description of English clauses]^] 

In the example, relative clauses are cross-classified according to two "dimen- 
sions"; on the one hand according to the Phrase-Type (Headed or Non-Headed), 
and on the other hand according to the Clause-Type (Interrogative, Declarative, or 

*This work was supported by the Deutsche Forschungsgemeinschaft, Special Research Division 
314 "Artificial Intelligence - Knowledge-Based Systems" through project N3 "Bidirectional Lin- 
guistic Deduction" (BiLD), the Commission of the European Communities through the project 
LRE-61-061 "Reusable Grammatical Resources," and Cray Systems. Part of the work was done 
during a visit at the Human Communication Research Centre, University of Edinburgh. I would 
like to thank the members of the Language Technology Group at HCRC, Chris Brew, Suresh 
Manandhar, Drew Moshier and Hans Uszkoreit for their comments. 

x This figure is taken from a slide presented by Ivan Sag at the 1993 EACL meeting in Utrecht. 
The use of horizontal lines in the hierarchy to indicate^ a split into several dimensions has been 
adopted from the book on the Core Language Engine H. 
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Figure 1: HPSG type hierarchy with multi-dimensional inheritance 

Relative). The choices within one dimension are mutually exclusive: no structure 
can be described as both Headed and Non-Headed, or as more than one of {Int, 
Decl, Rel}. However, a structure can be assigned types from different dimensions, 
without the need for a subtype that inherits from both dimensions. 

In section J2| we present a concept of typing, which provides direct support for 
multi-dimensional inheritance, and compare it to the type hierarchies in Carpenter's 
typed feature logic. Section |3| shows that unification in multi-dimensional inheri- 
tance hierarchies can be implemented efficiently as unification of a Prolog term 
representation of the types. Section |f| applies multi-dimensional inheritance to the 
problem of systemic classification. 

2 Multi-Dimensional Inheritance 



We follow the Carpenter's formalisation of typed feature logic, but modify the 
conception of the type hierarchy. In Carpenter's logic ||, the type hierarchy is 
required to be a bounded complete partial order (bcpo), which means that any two 
types which do have a common subtype must have a unique most general common 
subtype. A type hierarchy as in figure |l| fails this requirement because the types 
Headed-Ph and Rel have two common subtypes, but none of them is more general 
than the other. In order to make the type hierarchy a BCPO, additional types must 
be introduced, resulting in a hierarchy like the one in figure 0. In Carpenter's 
system, every feature structure has only one unique most general type, so that it is 
not possible to assign a feature structure two types neither of which subsumes the 
other unless they have a common subtypejj 

Carpenter [pi pagesl7-32] describes a conjunctive type construction by which a 
type hierarchy like the one in figure |l| can be converted into a bounded complete 
partial order like in figure |[ Once this is done, efficient algorithms for the calcula- 
tion of greatest lower bounds can be used g . We argue that such a conjunctive type 
construction is neither necessary for theoretical reasons nor for reasons of efficient 
implementation. 

In our system, a feature structure can have different types as long as they are 
chosen from different dimensions. Our syntax for subtype declarations, given in (fiT), 

2 A similar point holds for the type system of the logic programming language LIFE |3, 11, with 
the difference that the unification of two types need not have a unique result; the unification of the 
types Headed-Ph and Rel would create a choice point and produce the two alternative solutions 
5m- Wh-Rel and That- Rel. 
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Figure 2: HPSG type hierarchy with one-dimensional inheritance 



combines information about subtyping and disjointness. All the Yi are subtypes of 
X, and all Yi are disjoint. 



X>[Y 1 ,...,Y n ]. 



(1) 



Multi-dimensional inheritance arises in the case where there is more than one 
declaration with the same supertype on the left-hand side, as in (0). 



X > [Yi.i,...,Fi. 



(2) 



X > \Y„, 



i ^-m.k\ 



k ■ 



Instead of writing a separate subtype declaration for each dimension, multiple 
dimensions are conventionally connected with the product operator *, as in decla- 
ration (0), which is equivalent to the declarations in (0). 



X>[Y 1 . 1 ,...,Y 1 . n ]*...*[Y m 



i ■* m.fej ■ 



(3) 



Multiple inheritance is the case where some Y type occurs in the right-hand side 
of more than one type declaration. In this case, the type has several supertypes, 
which must be chosen from different dimensions in order to be consistent with each 
other.0 

We now turn to the semantics of type declarations. The denotation of each type 
is a subset of the domain. The semantics of the type declaration in (yj) is given by 
the axioms (Q), which states that the denotation of any of the Yi is a subset of the 
denotation of X , and (||), which states that all the Yi are disjoint. No additional 
axioms are needed in case of multi-dimensional inheritance. 



3 We ignore the case where one of the supertypes subsumes the other since such declarations 
are redundant. 



VTiQYi] C IX 'J) (4) 

VK ( vr, (k ( ^ r, =* [X] n K] - 0) (5) 

Our system has an open-world semantics for type hierarchies. A feature struc- 
ture can be described by two types from different hierarchies, but there need not be 
a common subtype of these two types. This is in contrast with a system like ALE 
|9[ with a closed-world semantics where the conjunction of two types is inconsistent 
unless one subsumes the other or they have a common subtype. 

sign > [non_headed_ph,headed_ph] * [int ,decl,rel] . 

headed_ph > [h_su,h_co,h_mk,h_f i] . 

int > [wh_int ,y_n] . 

rel > [wh_rel,non_wh_rel] . 

h_su > [su_wh_rel] . 

wh_rel > [su_wh_rel] . 

h_mk > [that_rel] . 

non_wh_rel > [that_rel] . 



Figure 3: Type declarations 

Our notion of feature typing and appropriateness is based on Carpenter's feature 
logic. Every feature is introduced for a unique most general type, and is appropriate 
for all subtypes of that type. In case of multiple inheritance, a type can inherit dif- 
ferent features from its supertypes in different dimensions. A difference arises with 
type restrictions for feature values. In Carpenter's system, the value of a feature 
has one type as the type restriction, whereas in our system, the type restriction can 
be a conjunction of types from different dimensions. 

In our system, the type hierarchy from figure [l] can be expressed directly with 
the declarations given in figure |3| 

Our notion of typing is similar to the one adopted in the Comprehensive Uni- 
fication Formalism, (cuf |ll[), in that it adopts an open- world semantics, and two 
types are considered as consistent unless they are explicitly declared to be disjoint. 
However, CUF allows to state more general type axioms using the full power of 
propositional logic. In our system, the type axioms are restricted to subtyping 
(which corresponds to implication) and disjointness. This restriction allows the 
efficient compilation of multi-dimensional inheritance hierarchies to Prolog terms, 
which will be described in the next section. 



3 Compilation into a Prolog Term Representation 

Multi-dimensional type hierarchies have the favourable property that the types can 
be compiled to a Prolog term representation. With this representation, Prolog's 
built-in term unification is all that is required to compute the conjunction of two 
types. The Prolog term representation given here builds on and extends the repre- 
sentation introduced by Mellish M, [17j and used in the Core Langauge Engine H 
and the ALEP grammar development system 0, H] . 

We start out by describing how the translation of type hierarchies into Prolog 
terms works, and then give an example. The translation to terms must be able to 
handle different dimensions of typing, mutually exclusive choices in a dimension, 
subtyping, multiple inheritance, features, and equality. 



Different dimensions: Each dimension occupies a different argument position in 
the resulting term representation, so that information from different dimen- 
sions can be combined by unification. 

Mutually exclusive types: Mutually exclusive types in the same dimension have 
different functors at the same argument position, so that their unification fails. 

Subtype: The term which corresponds to the subtype is a further instantiation of 
the term corresponding to its supertype. 

Multiple inheritance: The term which corresponds to the subtype is a further 
instantiation of the unification of the terms which correspond to its supertypes. 

Feature: The term representation has an argument position for each feature intro- 
duced for a type. If a feature is introduced for a subtype, then an argument 
position is provided in that argument which further instantiates the super- 
typefl 

Equality: In order to be able to distinguish structures that are identical from those 
which just happen to have the same value (i.e. their term representation is 
instantiated to the same ground term), an extra variable is introduced in the 
term representation (preventing instantiation to a ground term), which is only 
equal for two structures if they have been made identical by unification. 



Type Encoding 

Sign sign(_,_) 

Non-Headed-Ph sign(non_headed,_) 

Headed-Ph sign(headed_ph(_),_) 

Int sign(_,int(_)) 

Decl sign(_,decl) 

Rel sign(_,rel(_)) 

H-Su sign(headed_ph(h_su(_)),_) 

Wh-Rel sign(_,rel(wh_rel(_))) 

Su-Wh-Rel sign(headed_ph(h_su(su_wh_rel) ) ,rel(wh_rel(su_wh_rel) ) ) 



Figure 4: Encoding of Types as Prolog Terms 

Given this kind of representation, two typed feature structures can be unified 
simply by unification of the corresponding Prolog terms. In figure ^, we provide 
an encoding of the types in the hierarchy from figure [j] by Prolog terms. For 
simplicity, we leave out the argument position used to establish the equality of 
feature structures, and any argument positions used to encode features. Note that 
the type Su-Wh-Rel makes a choice in both dimensions. 

If the type Su-Wh-Rel has any subtypes, a choice must be made which of the 
two occurences of su_wh_rel in the Prolog terms should get argument positions 
for carrying this information.^ We always choose the leftmost occurence in a term 
for representing subtypes (and features). Further occurences then only serve to 
make a choice in a particular dimension of the hierarchy, and for that purpose, an 

4 The compilation of feature structures to Prolog terms has been described in \l'l\ [14, lfJ|, but 
these works assume untyped feature structures. One may wonder what the difference is between 
features and multiple typing dimensions, since they have very similar term representations. A 
technical answer would be that equality constraints (coreferences) can be stated over feature 
values, but not over type dimensions. Work by Moshier ]1 9] on a rational reconstruction of typed 
feature structures in domain theory approaches the question from a more fundamental perspective. 

5 From the correctness of the translation, it would be no problem representing the subtypes in 
both occurences, but such an encoding is clearly redundant. 



atom which is distinct from other terms that can occur as alternatives in the same 
dimension is sufficient. 



4 Application to Systemic Classification 

In this section, we apply the conception of multi-dimensional inheritance to systemic 
classification networks, which have been discussed in m&. 
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Figure 5: Winograd's systemic pronoun network 

Systemic classification networks are an interesting formalism for such an encod- 
ing because they offer considerable expressive power. Figure g| shows a systemic 
classification network for English pronouns taken from |23|| . 

Figure [| shows the connectives, and the translation of the first three in our sys- 
tem. The final connective (disjunctive entry condition) has no simple translation. 
This is not surprising, given the complexity analysis by Brew J7|] who shows sys- 
temic classification to be NP-hard by giving an encoding of the 3SAT-problem in a 
systemic network with disjunctive entry conditions. 

We treat disjunctive entry conditions by introduction of new types into the 
hierarchy. For each type X at the right-hand side of a disjunctive entry condition, 
we introduce two new types, X' for the original type, and ^X for its negation. 
These pairs of new types are introduced in different dimensions at the top of the 
choice system containing the disjunctive entry conditions. All types Y which have 
subtypes that are on the left-hand side of the disjuntive entry condition for X 
become subtypes of X' , and all other types become subtypes of ->X. In effect, this 
is an expansion of the disjunctive entry conditions to disjunctive normal form. 

In the worst case, this method can lead to translations which are exponentially 
larger than the original classification network, as for the 3SAT problem. We show by 
an example that this need not be the case in practice by converting the classification 
of pronouns given in figure M into our system (figure fij) . 

In this case, our translation is more efficient than the "brute-force translation" 
described in pa for networks with disjunctive entry conditions. In Winograd's 
systemic network, there are 54 possibilities (6 for interrogative pronouns, 36 for 
personal pronouns, and 4 for demonstrative pronouns), resulting in a brute- force 
translation with 55 arguments. [] 

6 The "brute-force translation" works by encoding n possibilities into a term with n + 1 argu- 
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Figure 6: Systemic network connectives and multi-dimensional inheritance 

In contrast, in our type system, the Prolog term translation has at most 8 
nodes in the worst case, as in the following term which encodes the subjective case 
masculine (third person singular) personal pronoun. 

pronoun (case (personal (third (masc) ) .subjective) ,num(personal,sg) ) 

5 Conclusion 



We have presented a concept of inheritance which provides direct support for current 
linguistic descriptions making use of "cross-classification" , and can be compiled into 
an efficient Prolog term representation. 

Given the need for multiple dimensions in lingistic descriptions, we believe that 
multi-dimensional type hierarchies will remain important even when their compila- 
tion into Prolog terms is not needed any longer because unification of typed feature 
terms will be built-in in future logic programming languages. 

For the time being, however, the combination of multi-dimensional inheritance 
and compilation into Prolog terms appears to give both the efficiency and the expres- 
sive power needed to develop larger-scale grammars and lexicons, and use existing 
Prolog-based technology (DCG parsers, left-corner, head-corner, or chart parsers, 
semantic-head driven or tabular generators) to build NLP systems. Such an ap- 
proach can benefit from all the advantages of modern Prolog compilers (indexing, 
coroutining facilities, module systems etc.) that would need considerable effort to 
duplicate in a dedicated grammar formalism. 

The multi-dimensional inheritance described in this paper is implemented in the 
system ProFIT [n2|, which translates programs containing typed feature terms to 

merits, whose first and last argument are different (either instantiated to different atoms or related 
by an inequality constraint). For encoding a value that excludes the n th possilibity, the n and 
n + 1 st argument position are unified. If a combination of values excludes all possibilities, then all 
arguments are unified with each other, including the first and the last, which are different, so that 
the unification fails. There is limited use for this type of translation to avoid creation of choice 
points in disjunctions with a small number of possible values. This can be regarded as a Prolog 
implementation of finite domains. 
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Figure 7: Pronoun hierarchy with disjunctions moved to the top 



ordinary Prolog programs. 
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